Dataset statistics
| Number of variables | 19 |
|---|---|
| Number of observations | 74690 |
| Missing cells | 288355 |
| Missing cells (%) | 20.3% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 10.8 MiB |
| Average record size in memory | 152.0 B |
Variable types
| CAT | 10 |
|---|---|
| NUM | 8 |
| UNSUPPORTED | 1 |
UploadDate has constant value "74690" | Constant |
Common Name has a high cardinality: 570 distinct values | High cardinality |
Scientific Name has a high cardinality: 535 distinct values | High cardinality |
Genus has a high cardinality: 168 distinct values | High cardinality |
Family has a high cardinality: 60 distinct values | High cardinality |
Date Planted has a high cardinality: 1779 distinct values | High cardinality |
CoordinateLocation has a high cardinality: 74671 distinct values | High cardinality |
Easting is highly correlated with Longitude | High correlation |
Longitude is highly correlated with Easting | High correlation |
Northing is highly correlated with Latitude | High correlation |
Latitude is highly correlated with Northing | High correlation |
Diameter Breast Height has 49268 (66.0%) missing values | Missing |
Year Planted has 8193 (11.0%) missing values | Missing |
Date Planted has 8193 (11.0%) missing values | Missing |
Age Description has 49286 (66.0%) missing values | Missing |
Useful Life Expectency has 49286 (66.0%) missing values | Missing |
Useful Life Expectency Value has 49286 (66.0%) missing values | Missing |
Precinct has 74690 (100.0%) missing values | Missing |
CoordinateLocation is uniformly distributed | Uniform |
CoM ID has unique values | Unique |
Precinct is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
| Analysis started | 2020-10-02 12:43:57.739391 |
|---|---|
| Analysis finished | 2020-10-02 12:44:13.838570 |
| Duration | 16.1 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 74690 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1340215.773 |
|---|---|
| Minimum | 1013381 |
| Maximum | 1787466 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 583.5 KiB |
Quantile statistics
| Minimum | 1013381 |
|---|---|
| 5-th percentile | 1021462.45 |
| Q1 | 1047582.25 |
| median | 1359858.5 |
| Q3 | 1578851.75 |
| 95-th percentile | 1769320.65 |
| Maximum | 1787466 |
| Range | 774085 |
| Interquartile range (IQR) | 531269.5 |
Descriptive statistics
| Standard deviation | 282800.3705 |
|---|---|
| Coefficient of variation (CV) | 0.2110110747 |
| Kurtosis | -1.621929265 |
| Mean | 1340215.773 |
| Median Absolute Deviation (MAD) | 303008 |
| Skewness | 0.1139895432 |
| Sum | 1.00100716e+11 |
| Variance | 7.997604955e+10 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 1052670 | 1 | < 0.1% | |
| 1051284 | 1 | < 0.1% | |
| 1518256 | 1 | < 0.1% | |
| 1522354 | 1 | < 0.1% | |
| 1782451 | 1 | < 0.1% | |
| 1510068 | 1 | < 0.1% | |
| 1512119 | 1 | < 0.1% | |
| 1524413 | 1 | < 0.1% | |
| 1734677 | 1 | < 0.1% | |
| 1782700 | 1 | < 0.1% | |
| Other values (74680) | 74680 | > 99.9% |
| Value | Count | Frequency (%) | |
| 1013381 | 1 | < 0.1% | |
| 1013382 | 1 | < 0.1% | |
| 1013383 | 1 | < 0.1% | |
| 1013384 | 1 | < 0.1% | |
| 1013385 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1787466 | 1 | < 0.1% | |
| 1787465 | 1 | < 0.1% | |
| 1787464 | 1 | < 0.1% | |
| 1787463 | 1 | < 0.1% | |
| 1787462 | 1 | < 0.1% |
| Distinct | 570 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 10 |
| Missing (%) | < 0.1% |
| Memory size | 583.5 KiB |
| River red gum | |
|---|---|
| London Plane | 5078 |
| Drooping sheoak | 3078 |
| Spotted Gum | 3011 |
| Yellow Box | 2921 |
| Other values (565) |
| Value | Count | Frequency (%) | |
| River red gum | 7969 | 10.7% | |
| London Plane | 5078 | 6.8% | |
| Drooping sheoak | 3078 | 4.1% | |
| Spotted Gum | 3011 | 4.0% | |
| Yellow Box | 2921 | 3.9% | |
| Black Wattle | 2622 | 3.5% | |
| English Elm | 2212 | 3.0% | |
| Elm | 1737 | 2.3% | |
| Lemon Scented Gum | 1438 | 1.9% | |
| Yellow Gum | 1429 | 1.9% | |
| Other values (560) | 43185 | 57.8% |
Frequencies of value counts
Unique
| Unique | 84 ? |
|---|---|
| Unique (%) | 0.1% |
Histogram of lengths of the category
Length
| Max length | 33 |
|---|---|
| Median length | 12 |
| Mean length | 12.70767171 |
| Min length | 3 |
| Distinct | 535 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 583.5 KiB |
| Eucalyptus camaldulensis | |
|---|---|
| Platanus x acerifolia | 5146 |
| Allocasuarina verticillata | 3078 |
| Corymbia maculata | 3011 |
| Eucalyptus melliodora | 2921 |
| Other values (530) |
| Value | Count | Frequency (%) | |
| Eucalyptus camaldulensis | 7969 | 10.7% | |
| Platanus x acerifolia | 5146 | 6.9% | |
| Allocasuarina verticillata | 3078 | 4.1% | |
| Corymbia maculata | 3011 | 4.0% | |
| Eucalyptus melliodora | 2921 | 3.9% | |
| Ulmus procera | 2212 | 3.0% | |
| Ulmus unknown | 1737 | 2.3% | |
| Eucalyptus leucoxylon | 1569 | 2.1% | |
| Corymbia citriodora | 1479 | 2.0% | |
| Acacia mearnsii | 1451 | 1.9% | |
| Other values (525) | 44116 | 59.1% |
Frequencies of value counts
Unique
| Unique | 90 ? |
|---|---|
| Unique (%) | 0.1% |
Histogram of lengths of the category
Length
| Max length | 41 |
|---|---|
| Median length | 19 |
| Mean length | 19.17338332 |
| Min length | 3 |
| Distinct | 168 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 583.5 KiB |
| Eucalyptus | |
|---|---|
| Acacia | |
| Ulmus | |
| Platanus | |
| Corymbia | |
| Other values (163) |
| Value | Count | Frequency (%) | |
| Eucalyptus | 17211 | 23.0% | |
| Acacia | 6382 | 8.5% | |
| Ulmus | 5921 | 7.9% | |
| Platanus | 5695 | 7.6% | |
| Corymbia | 4792 | 6.4% | |
| Allocasuarina | 3510 | 4.7% | |
| Quercus | 2322 | 3.1% | |
| Casuarina | 1864 | 2.5% | |
| Ficus | 1622 | 2.2% | |
| Melaleuca | 1516 | 2.0% | |
| Other values (158) | 23854 | 31.9% |
Frequencies of value counts
Unique
| Unique | 24 ? |
|---|---|
| Unique (%) | < 0.1% |
Histogram of lengths of the category
Length
| Max length | 16 |
|---|---|
| Median length | 8 |
| Mean length | 8.270384255 |
| Min length | 3 |
| Distinct | 60 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 583.5 KiB |
| Myrtaceae | |
|---|---|
| Fabaceae | |
| Ulmaceae | |
| Platanaceae | |
| Casuarinaceae | |
| Other values (55) |
| Value | Count | Frequency (%) | |
| Myrtaceae | 28470 | 38.1% | |
| Fabaceae | 6917 | 9.3% | |
| Ulmaceae | 6644 | 8.9% | |
| Platanaceae | 5695 | 7.6% | |
| Casuarinaceae | 5374 | 7.2% | |
| Fagaceae | 2330 | 3.1% | |
| Proteaceae | 1996 | 2.7% | |
| Sapindaceae | 1754 | 2.3% | |
| Moraceae | 1635 | 2.2% | |
| Pittosporaceae | 1381 | 1.8% | |
| Other values (50) | 12493 | 16.7% |
Frequencies of value counts
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | < 0.1% |
Histogram of lengths of the category
Length
| Max length | 18 |
|---|---|
| Median length | 9 |
| Mean length | 9.584616415 |
| Min length | 3 |
| Distinct | 220 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 49268 |
| Missing (%) | 66.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.59649123 |
|---|---|
| Minimum | 0 |
| Maximum | 347 |
| Zeros | 255 |
| Zeros (%) | 0.3% |
| Memory size | 583.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 14 |
| median | 29 |
| Q3 | 56 |
| 95-th percentile | 97 |
| Maximum | 347 |
| Range | 347 |
| Interquartile range (IQR) | 42 |
Descriptive statistics
| Standard deviation | 31.55417029 |
|---|---|
| Coefficient of variation (CV) | 0.8392849773 |
| Kurtosis | 3.599336201 |
| Mean | 37.59649123 |
| Median Absolute Deviation (MAD) | 19 |
| Skewness | 1.45523555 |
| Sum | 955778 |
| Variance | 995.6656625 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 4 | 608 | 0.8% | |
| 5 | 606 | 0.8% | |
| 6 | 587 | 0.8% | |
| 3 | 563 | 0.8% | |
| 7 | 522 | 0.7% | |
| 15 | 504 | 0.7% | |
| 19 | 490 | 0.7% | |
| 17 | 488 | 0.7% | |
| 10 | 476 | 0.6% | |
| 20 | 476 | 0.6% | |
| Other values (210) | 20102 | 26.9% | |
| (Missing) | 49268 | 66.0% |
| Value | Count | Frequency (%) | |
| 0 | 255 | 0.3% | |
| 1 | 139 | 0.2% | |
| 2 | 359 | 0.5% | |
| 3 | 563 | 0.8% | |
| 4 | 608 | 0.8% |
| Value | Count | Frequency (%) | |
| 347 | 1 | < 0.1% | |
| 310 | 1 | < 0.1% | |
| 283 | 1 | < 0.1% | |
| 273 | 1 | < 0.1% | |
| 268 | 1 | < 0.1% |
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 8193 |
| Missing (%) | 11.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1998.997489 |
|---|---|
| Minimum | 1899 |
| Maximum | 2020 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 583.5 KiB |
Quantile statistics
| Minimum | 1899 |
|---|---|
| 5-th percentile | 1900 |
| Q1 | 1998 |
| median | 2008 |
| Q3 | 2015 |
| 95-th percentile | 2019 |
| Maximum | 2020 |
| Range | 121 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 30.4362635 |
|---|---|
| Coefficient of variation (CV) | 0.01522576375 |
| Kurtosis | 6.091514537 |
| Mean | 1998.997489 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | -2.699518887 |
| Sum | 132927336 |
| Variance | 926.366136 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=24)
| Value | Count | Frequency (%) | |
| 1998 | 12252 | 16.4% | |
| 1997 | 6662 | 8.9% | |
| 1900 | 5323 | 7.1% | |
| 2019 | 3883 | 5.2% | |
| 2015 | 3753 | 5.0% | |
| 2017 | 3238 | 4.3% | |
| 2012 | 3147 | 4.2% | |
| 2006 | 3079 | 4.1% | |
| 1999 | 3041 | 4.1% | |
| 2020 | 2848 | 3.8% | |
| Other values (14) | 19271 | 25.8% | |
| (Missing) | 8193 | 11.0% |
| Value | Count | Frequency (%) | |
| 1899 | 48 | 0.1% | |
| 1900 | 5323 | 7.1% | |
| 1977 | 1 | < 0.1% | |
| 1997 | 6662 | 8.9% | |
| 1998 | 12252 | 16.4% |
| Value | Count | Frequency (%) | |
| 2020 | 2848 | 3.8% | |
| 2019 | 3883 | 5.2% | |
| 2018 | 2584 | 3.5% | |
| 2017 | 3238 | 4.3% | |
| 2016 | 2502 | 3.3% |
| Distinct | 1779 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 8193 |
| Missing (%) | 11.0% |
| Memory size | 583.5 KiB |
| 01/01/2000 | 3268 |
|---|---|
| 02/01/2000 | 3260 |
| 27/04/2012 | 999 |
| 04/07/2009 | 660 |
| 14/12/1998 | 582 |
| Other values (1774) |
| Value | Count | Frequency (%) | |
| 01/01/2000 | 3268 | 4.4% | |
| 02/01/2000 | 3260 | 4.4% | |
| 27/04/2012 | 999 | 1.3% | |
| 04/07/2009 | 660 | 0.9% | |
| 14/12/1998 | 582 | 0.8% | |
| 16/12/1998 | 578 | 0.8% | |
| 17/12/1998 | 540 | 0.7% | |
| 12/11/1998 | 496 | 0.7% | |
| 12/03/1998 | 490 | 0.7% | |
| 13/01/1999 | 475 | 0.6% | |
| Other values (1769) | 55149 | 73.8% | |
| (Missing) | 8193 | 11.0% |
Frequencies of value counts
Unique
| Unique | 252 ? |
|---|---|
| Unique (%) | 0.4% |
Histogram of lengths of the category
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.232146204 |
| Min length | 3 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 49286 |
| Missing (%) | 66.0% |
| Memory size | 583.5 KiB |
| Semi-Mature | |
|---|---|
| Mature | |
| Juvenile | |
| New | |
| Over-mature | 297 |
| Value | Count | Frequency (%) | |
| Semi-Mature | 10632 | 14.2% | |
| Mature | 9025 | 12.1% | |
| Juvenile | 4027 | 5.4% | |
| New | 1423 | 1.9% | |
| Over-mature | 297 | 0.4% | |
| (Missing) | 49286 | 66.0% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 11 |
|---|---|
| Median length | 3 |
| Mean length | 4.802677735 |
| Min length | 3 |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 49286 |
| Missing (%) | 66.0% |
| Memory size | 583.5 KiB |
| 31-60 years | |
|---|---|
| 21-30 years | |
| 11-20 years | |
| 61+ years | |
| 6-10 years (>50% canopy) | |
| Other values (5) |
| Value | Count | Frequency (%) | |
| 31-60 years | 7363 | 9.9% | |
| 21-30 years | 5004 | 6.7% | |
| 11-20 years | 4311 | 5.8% | |
| 61+ years | 3163 | 4.2% | |
| 6-10 years (>50% canopy) | 1870 | 2.5% | |
| 1-5 years (<50% canopy) | 1539 | 2.1% | |
| 6-10 years (<50% canopy) | 1375 | 1.8% | |
| 1-5 years (>50% canopy) | 335 | 0.4% | |
| <1 year | 317 | 0.4% | |
| 1-5 years | 127 | 0.2% | |
| (Missing) | 49286 | 66.0% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 24 |
|---|---|
| Median length | 3 |
| Mean length | 6.481818182 |
| Min length | 3 |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 49286 |
| Missing (%) | 66.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38.33774209 |
|---|---|
| Minimum | 1 |
| Maximum | 80 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 583.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 20 |
| median | 30 |
| Q3 | 60 |
| 95-th percentile | 80 |
| Maximum | 80 |
| Range | 79 |
| Interquartile range (IQR) | 40 |
Descriptive statistics
| Standard deviation | 25.10811028 |
|---|---|
| Coefficient of variation (CV) | 0.6549188582 |
| Kurtosis | -1.331661476 |
| Mean | 38.33774209 |
| Median Absolute Deviation (MAD) | 25 |
| Skewness | 0.2726180147 |
| Sum | 973932 |
| Variance | 630.4172016 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=7)
| Value | Count | Frequency (%) | |
| 60 | 7363 | 9.9% | |
| 30 | 5004 | 6.7% | |
| 20 | 4311 | 5.8% | |
| 10 | 3245 | 4.3% | |
| 80 | 3163 | 4.2% | |
| 5 | 2001 | 2.7% | |
| 1 | 317 | 0.4% | |
| (Missing) | 49286 | 66.0% |
| Value | Count | Frequency (%) | |
| 1 | 317 | 0.4% | |
| 5 | 2001 | 2.7% | |
| 10 | 3245 | 4.3% | |
| 20 | 4311 | 5.8% | |
| 30 | 5004 | 6.7% |
| Value | Count | Frequency (%) | |
| 80 | 3163 | 4.2% | |
| 60 | 7363 | 9.9% | |
| 30 | 5004 | 6.7% | |
| 20 | 4311 | 5.8% | |
| 10 | 3245 | 4.3% |
Located in
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 140 |
| Missing (%) | 0.2% |
| Memory size | 583.5 KiB |
| Park | |
|---|---|
| Street |
| Value | Count | Frequency (%) | |
| Park | 44565 | 59.7% | |
| Street | 29985 | 40.1% | |
| (Missing) | 140 | 0.2% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.801044317 |
| Min length | 3 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 583.5 KiB |
| 01/10/2020 |
|---|
| Value | Count | Frequency (%) | |
| 01/10/2020 | 74690 | 100.0% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
| Distinct | 74671 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 583.5 KiB |
| (-37.7914833480749, 144.94707053642645) | 2 |
|---|---|
| (-37.78241357208308, 144.94903799941775) | 2 |
| (-37.78402113407502, 144.95636067748285) | 2 |
| (-37.78357828352511, 144.95565900304265) | 2 |
| (-37.78412681859444, 144.95657225912905) | 2 |
| Other values (74666) |
| Value | Count | Frequency (%) | |
| (-37.7914833480749, 144.94707053642645) | 2 | < 0.1% | |
| (-37.78241357208308, 144.94903799941775) | 2 | < 0.1% | |
| (-37.78402113407502, 144.95636067748285) | 2 | < 0.1% | |
| (-37.78357828352511, 144.95565900304265) | 2 | < 0.1% | |
| (-37.78412681859444, 144.95657225912905) | 2 | < 0.1% | |
| (-37.7840762536616, 144.95582911550218) | 2 | < 0.1% | |
| (-37.785400121796584, 144.95432601989748) | 2 | < 0.1% | |
| (-37.78344686646531, 144.95543950381864) | 2 | < 0.1% | |
| (-37.783626795574406, 144.95572295612783) | 2 | < 0.1% | |
| (-37.783743992045764, 144.9557514065785) | 2 | < 0.1% | |
| Other values (74661) | 74670 | > 99.9% |
Frequencies of value counts
Unique
| Unique | 74652 ? |
|---|---|
| Unique (%) | 99.9% |
Histogram of lengths of the category
Length
| Max length | 41 |
|---|---|
| Median length | 40 |
| Mean length | 39.89732227 |
| Min length | 35 |
| Distinct | 74671 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -37.80134838 |
|---|---|
| Minimum | -37.85053172 |
| Maximum | -37.77551137 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 583.5 KiB |
Quantile statistics
| Minimum | -37.85053172 |
|---|---|
| 5-th percentile | -37.83022459 |
| Q1 | -37.81409888 |
| median | -37.79674565 |
| Q3 | -37.78883641 |
| 95-th percentile | -37.78012744 |
| Maximum | -37.77551137 |
| Range | 0.07502034875 |
| Interquartile range (IQR) | 0.02526246243 |
Descriptive statistics
| Standard deviation | 0.01633835597 |
|---|---|
| Coefficient of variation (CV) | -0.0004322162218 |
| Kurtosis | -0.5294818137 |
| Mean | -37.80134838 |
| Median Absolute Deviation (MAD) | 0.01092711602 |
| Skewness | -0.6245853954 |
| Sum | -2823382.711 |
| Variance | 0.000266941876 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| -37.78241357 | 2 | < 0.1% | |
| -37.7836268 | 2 | < 0.1% | |
| -37.78340298 | 2 | < 0.1% | |
| -37.78327011 | 2 | < 0.1% | |
| -37.78402113 | 2 | < 0.1% | |
| -37.7841318 | 2 | < 0.1% | |
| -37.7868515 | 2 | < 0.1% | |
| -37.78344687 | 2 | < 0.1% | |
| -37.78374399 | 2 | < 0.1% | |
| -37.78326289 | 2 | < 0.1% | |
| Other values (74661) | 74670 | > 99.9% |
| Value | Count | Frequency (%) | |
| -37.85053172 | 1 | < 0.1% | |
| -37.85051819 | 1 | < 0.1% | |
| -37.85049915 | 1 | < 0.1% | |
| -37.85048587 | 1 | < 0.1% | |
| -37.8504845 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| -37.77551137 | 1 | < 0.1% | |
| -37.77551839 | 1 | < 0.1% | |
| -37.77552351 | 1 | < 0.1% | |
| -37.77553446 | 1 | < 0.1% | |
| -37.77554112 | 1 | < 0.1% |
| Distinct | 74671 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 144.9516034 |
|---|---|
| Minimum | 144.9004497 |
| Maximum | 144.9910558 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 583.5 KiB |
Quantile statistics
| Minimum | 144.9004497 |
|---|---|
| 5-th percentile | 144.9164628 |
| Q1 | 144.94286 |
| median | 144.9514732 |
| Q3 | 144.9638143 |
| 95-th percentile | 144.982993 |
| Maximum | 144.9910558 |
| Range | 0.0906060254 |
| Interquartile range (IQR) | 0.02095434501 |
Descriptive statistics
| Standard deviation | 0.0188309371 |
|---|---|
| Coefficient of variation (CV) | 0.0001299118924 |
| Kurtosis | -0.2996784642 |
| Mean | 144.9516034 |
| Median Absolute Deviation (MAD) | 0.009768643564 |
| Skewness | -0.1872139784 |
| Sum | 10826435.25 |
| Variance | 0.0003546041921 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 144.954326 | 2 | < 0.1% | |
| 144.9554552 | 2 | < 0.1% | |
| 144.9629381 | 2 | < 0.1% | |
| 144.9558291 | 2 | < 0.1% | |
| 144.955659 | 2 | < 0.1% | |
| 144.955723 | 2 | < 0.1% | |
| 144.9565723 | 2 | < 0.1% | |
| 144.9553334 | 2 | < 0.1% | |
| 144.9550477 | 2 | < 0.1% | |
| 144.9561377 | 2 | < 0.1% | |
| Other values (74661) | 74670 | > 99.9% |
| Value | Count | Frequency (%) | |
| 144.9004497 | 1 | < 0.1% | |
| 144.9004721 | 1 | < 0.1% | |
| 144.9004768 | 1 | < 0.1% | |
| 144.9004949 | 1 | < 0.1% | |
| 144.9005248 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 144.9910558 | 1 | < 0.1% | |
| 144.99104 | 1 | < 0.1% | |
| 144.9910391 | 1 | < 0.1% | |
| 144.9910214 | 1 | < 0.1% | |
| 144.991011 | 1 | < 0.1% |
| Distinct | 68570 |
|---|---|
| Distinct (%) | 91.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 319662.4184 |
|---|---|
| Minimum | 315230.55 |
| Maximum | 323159.33 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 583.5 KiB |
Quantile statistics
| Minimum | 315230.55 |
|---|---|
| 5-th percentile | 316556.1465 |
| Q1 | 318879.015 |
| median | 319627.51 |
| Q3 | 320753.6725 |
| 95-th percentile | 322495.971 |
| Maximum | 323159.33 |
| Range | 7928.78 |
| Interquartile range (IQR) | 1874.6575 |
Descriptive statistics
| Standard deviation | 1673.397429 |
|---|---|
| Coefficient of variation (CV) | 0.005234889473 |
| Kurtosis | -0.3263315695 |
| Mean | 319662.4184 |
| Median Absolute Deviation (MAD) | 865.03 |
| Skewness | -0.1402760251 |
| Sum | 2.387558603e+10 |
| Variance | 2800258.956 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 319033.4 | 5 | < 0.1% | |
| 319522.3 | 5 | < 0.1% | |
| 319719.7 | 5 | < 0.1% | |
| 319724.45 | 5 | < 0.1% | |
| 319365.04 | 4 | < 0.1% | |
| 319451.96 | 4 | < 0.1% | |
| 320339.81 | 4 | < 0.1% | |
| 319626.5 | 4 | < 0.1% | |
| 320339.09 | 4 | < 0.1% | |
| 319195.79 | 4 | < 0.1% | |
| Other values (68560) | 74646 | 99.9% |
| Value | Count | Frequency (%) | |
| 315230.55 | 1 | < 0.1% | |
| 315232.11 | 1 | < 0.1% | |
| 315232.43 | 1 | < 0.1% | |
| 315234.54 | 1 | < 0.1% | |
| 315236.94 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 323159.33 | 1 | < 0.1% | |
| 323157.94 | 1 | < 0.1% | |
| 323157.89 | 1 | < 0.1% | |
| 323156.69 | 1 | < 0.1% | |
| 323156.32 | 1 | < 0.1% |
| Distinct | 68878 |
|---|---|
| Distinct (%) | 92.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5814248.917 |
|---|---|
| Minimum | 5808853.08 |
| Maximum | 5817092.14 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 583.5 KiB |
Quantile statistics
| Minimum | 5808853.08 |
|---|---|
| 5-th percentile | 5811062.2 |
| Q1 | 5812868.442 |
| median | 5814736.7 |
| Q3 | 5815629.835 |
| 95-th percentile | 5816604.429 |
| Maximum | 5817092.14 |
| Range | 8239.06 |
| Interquartile range (IQR) | 2761.3925 |
Descriptive statistics
| Standard deviation | 1799.528103 |
|---|---|
| Coefficient of variation (CV) | 0.0003095031066 |
| Kurtosis | -0.5455257753 |
| Mean | 5814248.917 |
| Median Absolute Deviation (MAD) | 1215.125 |
| Skewness | -0.60924807 |
| Sum | 4.342662516e+11 |
| Variance | 3238301.392 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 5814938.17 | 6 | < 0.1% | |
| 5814937.89 | 5 | < 0.1% | |
| 5815513.55 | 5 | < 0.1% | |
| 5815525.17 | 5 | < 0.1% | |
| 5814936.95 | 4 | < 0.1% | |
| 5816395.06 | 4 | < 0.1% | |
| 5815529.74 | 4 | < 0.1% | |
| 5816440.76 | 4 | < 0.1% | |
| 5815460.99 | 4 | < 0.1% | |
| 5814075.61 | 4 | < 0.1% | |
| Other values (68868) | 74645 | 99.9% |
| Value | Count | Frequency (%) | |
| 5808853.08 | 1 | < 0.1% | |
| 5808854.38 | 1 | < 0.1% | |
| 5808856.25 | 1 | < 0.1% | |
| 5808857.56 | 1 | < 0.1% | |
| 5808857.59 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 5817092.14 | 1 | < 0.1% | |
| 5817091.23 | 1 | < 0.1% | |
| 5817090.92 | 1 | < 0.1% | |
| 5817089.83 | 1 | < 0.1% | |
| 5817089.23 | 1 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| CoM ID | Common Name | Scientific Name | Genus | Family | Diameter Breast Height | Year Planted | Date Planted | Age Description | Useful Life Expectency | Useful Life Expectency Value | Precinct | Located in | UploadDate | CoordinateLocation | Latitude | Longitude | Easting | Northing | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1492924 | Flame Tree | Brachychiton acerifolius | Brachychiton | Malvaceae | NaN | 2011.0 | 12/08/2011 | NaN | NaN | NaN | NaN | Park | 01/10/2020 | (-37.795282936672386, 144.9173922130217) | -37.795283 | 144.917392 | 316635.33 | 5814855.56 |
| 1 | 1019881 | Golden Honeylocust | Gleditsia triacanthos | Gleditsia | Fabaceae | 16.0 | 1900.0 | 02/01/2000 | Semi-Mature | 31-60 years | 60.0 | NaN | Street | 01/10/2020 | (-37.79566841931897, 144.95597171021834) | -37.795668 | 144.955972 | 320033.39 | 5814887.79 |
| 2 | 1031022 | Spotted Gum | Corymbia maculata | Corymbia | Myrtaceae | 31.0 | 1997.0 | 12/12/1997 | Semi-Mature | 31-60 years | 60.0 | NaN | Street | 01/10/2020 | (-37.8210392069869, 144.91843779301422) | -37.821039 | 144.918438 | 316791.08 | 5811999.52 |
| 3 | 1050432 | Black Wattle | Acacia mearnsii | Acacia | Fabaceae | NaN | 1998.0 | 18/12/1998 | NaN | NaN | NaN | NaN | Park | 01/10/2020 | (-37.78286326814045, 144.95442254132493) | -37.782863 | 144.954423 | 319865.88 | 5816305.74 |
| 4 | 1029800 | Variegated Elm | Ulmus minor | Ulmus | Ulmaceae | 22.0 | 1997.0 | 12/08/1997 | Semi-Mature | 11-20 years | 20.0 | NaN | Street | 01/10/2020 | (-37.822346761279306, 144.96499971222198) | -37.822347 | 144.965000 | 320892.83 | 5811944.75 |
| 5 | 1286590 | London Plane | Platanus x acerifolia | Platanus | Platanaceae | 59.0 | NaN | NaN | Mature | 1-5 years (>50% canopy) | 5.0 | NaN | Street | 01/10/2020 | (-37.789091353254534, 144.92975715633244) | -37.789091 | 144.929757 | 317708.91 | 5815566.81 |
| 6 | 1781809 | Wattle | Acacia unknown | Acacia | Fabaceae | NaN | 2020.0 | 19/03/2020 | NaN | NaN | NaN | NaN | Park | 01/10/2020 | (-37.820435683194596, 144.94359660008945) | -37.820436 | 144.943597 | 319004.20 | 5812115.55 |
| 7 | 1048800 | Lemon Bottlebrush | Callistemon pallidus | Callistemon | Myrtaceae | NaN | 1998.0 | 15/12/1998 | NaN | NaN | NaN | NaN | Park | 01/10/2020 | (-37.78971502333605, 144.94675314698821) | -37.789715 | 144.946753 | 319207.14 | 5815530.62 |
| 8 | 1525508 | Norfolk Island Pine | Araucaria heterophylla | Araucaria | Araucariaceae | NaN | 2012.0 | 27/04/2012 | NaN | NaN | NaN | NaN | Street | 01/10/2020 | (-37.81398704347387, 144.944689194818) | -37.813987 | 144.944689 | 319084.63 | 5812833.25 |
| 9 | 1784818 | Red Pokers | Hakea bucculenta | Hakea | Proteaceae | NaN | 2020.0 | 23/06/2020 | NaN | NaN | NaN | NaN | Park | 01/10/2020 | (-37.82681988077671, 144.98375951144155) | -37.826820 | 144.983760 | 322554.83 | 5811484.20 |
Last rows
| CoM ID | Common Name | Scientific Name | Genus | Family | Diameter Breast Height | Year Planted | Date Planted | Age Description | Useful Life Expectency | Useful Life Expectency Value | Precinct | Located in | UploadDate | CoordinateLocation | Latitude | Longitude | Easting | Northing | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 74680 | 1034163 | Huntingdon Elm | Ulmus x hollandica | Ulmus | Ulmaceae | 66.0 | 1998.0 | 24/09/1998 | Mature | <1 year | 1.0 | NaN | Park | 01/10/2020 | (-37.82700832792828, 144.98381277992095) | -37.827008 | 144.983813 | 322559.97 | 5811463.39 |
| 74681 | 1036571 | English Elm | Ulmus procera | Ulmus | Ulmaceae | 110.0 | 1998.0 | 27/10/1998 | Mature | 11-20 years | 20.0 | NaN | Park | 01/10/2020 | (-37.80134922468855, 144.9730968954069) | -37.801349 | 144.973097 | 321555.01 | 5814290.26 |
| 74682 | 1046335 | River red gum | Eucalyptus camaldulensis | Eucalyptus | Myrtaceae | NaN | 1998.0 | 12/07/1998 | NaN | NaN | NaN | NaN | Park | 01/10/2020 | (-37.788357846785786, 144.95262311600283) | -37.788358 | 144.952623 | 319720.76 | 5815692.56 |
| 74683 | 1604096 | Chinese Elm | Ulmus parvifolia | Ulmus | Ulmaceae | NaN | 2015.0 | 14/08/2015 | NaN | NaN | NaN | NaN | Street | 01/10/2020 | (-37.820559637253, 144.9445744716103) | -37.820560 | 144.944574 | 319090.58 | 5812103.69 |
| 74684 | 1440058 | English Oak | Quercus robur | Quercus | Fagaceae | NaN | 2009.0 | 12/10/2009 | NaN | NaN | NaN | NaN | Park | 01/10/2020 | (-37.8310841406414, 144.97480645272424) | -37.831084 | 144.974806 | 321777.07 | 5810993.96 |
| 74685 | 1015316 | London Plane | Platanus x acerifolia | Platanus | Platanaceae | 36.0 | 1900.0 | 02/01/2000 | Semi-Mature | 11-20 years | 20.0 | NaN | Street | 01/10/2020 | (-37.797173995067354, 144.95074876642542) | -37.797174 | 144.950749 | 319577.15 | 5814710.65 |
| 74686 | 1656789 | Turkey Oak | Quercus cerris | Quercus | Fagaceae | NaN | 2017.0 | 26/04/2017 | NaN | NaN | NaN | NaN | Street | 01/10/2020 | (-37.8010568898896, 144.96525358940465) | -37.801057 | 144.965254 | 320863.72 | 5814307.69 |
| 74687 | 1635035 | Prickly-leaved paperbark | Melaleuca nodosa | Melaleuca | Myrtaceae | NaN | 2016.0 | 08/06/2016 | NaN | NaN | NaN | NaN | Park | 01/10/2020 | (-37.82092659909152, 144.94676981194382) | -37.820927 | 144.946770 | 319284.72 | 5812067.22 |
| 74688 | 1584555 | Swamp Sheoak | Casuarina obesa | Casuarina | Casuarinaceae | NaN | 2015.0 | 18/06/2015 | NaN | NaN | NaN | NaN | Park | 01/10/2020 | (-37.79512590199677, 144.95222072733162) | -37.795126 | 144.952221 | 319701.78 | 5814940.76 |
| 74689 | 1025972 | English Elm | Ulmus procera | Ulmus | Ulmaceae | 116.0 | 1997.0 | 24/11/1997 | Mature | 11-20 years | 20.0 | NaN | Park | 01/10/2020 | (-37.80981079456008, 144.9899121616535) | -37.809811 | 144.989912 | 323055.75 | 5813383.30 |